a 



X 



Chapter 1 

Jan Tinbergen's legacy for economic networks: 

from the gravity model to quantum statistics 
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•^ Abstract Jan Tinbergen, the first recipient of the Nobel Memorial Prize in Eco- 

^^ nomics in 1969, obtained his PhD in physics at the University of Leiden under the 

^_^ supervision of Paul Ehrenfest in 1929. Among many achievements as an economist 

after his training as a physicist, Tinbergen proposed the so-called Gravity Model of 
|~^ international trade. The model predicts that the intensity of trade between two coun- 

^— ( tries is described by a formula similar to Newton's law of gravitation, where mass 

^^ is replaced by Gross Domestic Product. Since Tinbergen's proposal, the Gravity 

Model has become the standard model of non-zero trade flows in macroeconomics. 

However, its intrinsic limitation is the prediction of a completely connected net- 
I work, which fails to explain the observed intricate topology of international trade. 

'-^, ' Recent network models overcome this limitation by describing the real network as a 

member of a maximum-entropy statistical ensemble. The resulting expressions are 
I— I formally analogous to quantum statistics: the international trade network is found 

_K^ to closely follow the Fermi-Dirac statistics in its purely binary topology, and the 

recently proposed mixed Bose-Fermi statistics in its full (binary plus weighted) 



^vj structure. This seemingly esoteric result is actually a simple effect of the hetero- 

^^ geneity of world countries, that imposes strong structural constraints on the network. 

.• Our discussion highlights similarities and differences between macroeconomics and 

(^\ statistical-physics approaches to economic networks. 
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1.1 Introduction 

Over the last fifteen years, there has been an ever-increasing interest in the study of 
networks across many scientific discipHnes, from physics to biology and the social 
sciences [1]. Economics is no exception. Empirical iJlIll and theoretical |4, 5 1 anal- 
yses of economic networks have been growing steadily, gradually encompassing 
different scales: from 'microscopic' networks of individual agents and financial as- 
sets ||4]|6|, through 'mesoscopic' networks of firms, banks and institutions IJUlllHl, 
to 'macroscopic' networks of world countries and economic sectors I9l [TOl [TTIl . 
An unprecedented element of continuity across these different economic scales has 
been the search for empirical laws characterizing real-world networks and the sub- 
sequent introduction of simple models aimed at reproducing the observed 'stylized 
facts'. Placing observations, rather than theoretical postulates, at the starting point 
of scientific investigations is probably the main positive outcome of the interaction 
between economists and physicists, an interaction that - over the last two decades 
- has given rise to the controversial field of 'Econophysics'. The interdisciplinary 
study of economic networks is another very fruitful result of this interaction. The 
added value of using the network approach to economic problems is the possibility 
to investigate indirect effects arising as the combination of many pairwise inter- 
actions between economic agents or units. The prototypical example is the study 
of systemic risk, i.e. the risk of a system-wide cascade of defaults of banks or in- 
stitutions connected to each other in a financial network, as opposed to traditional 
measures of risk for single financial entities. 

Despite the 'network approach' is relatively recent, much earlier studies in Eco- 
nomics akeady recognized the importance of (what we now call) socio-economic 
networks, even if this knowledge was more or less dispersed across sub-fields that 
used to be largely disjoint. An important example is the so-called Gravity Model 
lfl2J . The name originates from a loose analogy with Newton's law of gravitation, 
which states that the gravitational force between two objects is proportional to the 
product of their masses, and inversely proportional to the square of the distance 
between them. Strictly applying the analogy to the economic setting, the Gravity 
Model (see lfT2l for an excellent review) assumes that a 'mass' 5, of goods (or ser- 
vices, or factors of production such as labor) supplied at an origin / is 'attracted' 
to a mass Dj of demand for such goods located at a destination j. This attraction 
generates a flow Fij of goods, but the flow is reduced by the geographic distance dij 
between origin and destination as follows 

SiDj 
Fij^K^. (1.1) 

ij 

where K is a global free parameter to be fitted to real data. The Gravity Model 
predicts larger fluxes between closer and 'bigger' (in terms of the size of supply and 
demand) locations, exactly in the same way as the gravitational force is stronger 
between closer and more massive objects. 
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The use of the Gravity Model was pioneered by Ravenstein ifTSl in studies of 
migration patterns, where flows represent movements of people, and 5, and Dj are 
mainly determined by the sizes of the two populations located at the origin and 
destination. Jan Tinbergen, the first recipient of the Nobel Memorial Prize in eco- 
nomics, was instead the first to use the Gravity Model to explain international trade 
flows lfT4ll . In this case, flows represent movements of goods among world countries, 
and Si and Dj are expected to be determined by the values of the Gross Domestic 
Product (GDP) of the countries of origin and destination, i.e. 5, = D, = GDPj. 

Indeed, it is precisely as the first data-driven model of trade that the Gravity 
Model acquired its great popularity, presumably because of the accuracy with which 
it predicts observed trade fluxes. On the other hand, international trade is also one 
of the best examples of economic systems that have been intensively studied, us- 
ing a completely different 'network' approach, over the last decade. This coinci- 
dence makes the International Trade Network a useful example to compare tradi- 
tional (economic) and recent (network) approaches when applied to the same sys- 
tem, which in this case is also an empirically well documented one. For this reason, 
in what follows we will focus on Jan Tinbergen's Gravity Model of trade, its suc- 
cesses and limitations, and the more recent approaches that overcome some of these 
limitations. As another curious coincidence, these very different frameworks have 
a common element: the modeling of an economic network in close analogy with 
physical laws, from gravitation to statistical physics and quantum statistics. 



1.2 Jan Tinbergen and the Gravity Model of trade 

In a slightly (and not fully) generalized form, the Gravity Model of international 
trade states that the expected amount of trade from country / to country j is 

GDP" GDP^ 

{yvij}=K '-y. ^ (1.2) 

where dij is the geographic distance between countries, and a, (5 and 7 are addi- 
tional (besides K) free parameters. The angular brackets in eq.( |1.2| i denote an ex- 
pected value: this means that the model is not intended to be a deterministic one, 
since real data will obviously deviate from the postulated expression. So, strictly 
speaking, the model predicts that the realized amount of trade is Wij — (wij) + e 
where e is an error term with zero mean (if a linear regression to the observed trade 
flows is used), or alternatively Wjj — (wij) ■ 77 where 77 is an error term with unit 
mean (if a linear regression to the logarithm of the observed trade flows is used). 
In both cases, the fitted values of the parameters are usually around 0: « j3 w 7 « 1 
lfT2l . Further extensions of eq.( |1.2| l include additional factors either favouring or 
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suppressing trade [^Despite the inclusion of these additional factors improves the fit, 
the main factor determining trade flows remains the GDP, followed by geographic 
distances. So eq.( |1.2| , with exponents a ~ j3 ~ yw 1, captures the basic lesson 
learnt from real trade data and makes the Gravity Model closer to the expression for 
the gravitational energy (7=1) than to the one for the gravitational force (7=2). 

Obviously, there is absolutely nothing fundamental in the formal analogy be- 
tween the empirical laws of trade (or any other economic flux) and gravity, and 
no profound reason why these laws should bear any mathematical similarity at all. 
Rather, the deep similarities must be looked for at different levels: 



A first analogy involves the implicit use of symmetry in both cases: both eq. (1.2 1 
and Newton's law, state that, all else being equal, only mass/GDP and distance 
determine the amplitude of the interaction. In physics, this means that Newton's 
law holds in vacuum, i.e. in absence of anything else that can interact gravita- 



tionally with the two objects. In macroeconomics, this means that eq.( 1.2 1 holds 
in absence of other factors affecting trade, such as the additional regressors we 
mentioned in footnote [T] 

• A second similarity is the qualitative dependence on the key quantities: both 
laws assume that interaction amplitudes increase with increasing mass/GDP, and 
decrease with increasing distance. In principle, there is an infinity of quantita- 
tive ways (functional forms) to implement this qualitative idea. Accidentally, the 
functional forms describing gravitation and trade turn out to be very similar, but 
their qualitative analogy would have held even if the two mathematical expres- 
sions were different. In some sense, this makes the qualitative analogy more fun- 
damental than the mathematical one. 

• The above consideration leads us to a third analogy, i.e. the phenomenological 



character common to eq. (1.2 1 and Newton's formula. In both laws, the partic- 
ular functional form that implements the previous theoretical arguments is es- 
tablished on the basis of its success in reproducing real data, and thus a poste- 
riori. Other functional forms, while possible a priori on the basis of the above 
two points, must be discarded if they do not explain observations. Only after 
they were widely accepted as powerful empirical laws explaining observations, 
Netwon's law and the Gravity Model became the 'target' of more general and 
abstract theories. For instance, to be acceptable, Einstein's theory of General 
Relativity must reduce to Netwon's law in the appropriate circumstances, and 
micro-founded economic models must generate the Gravity Model when aggre- 
gated at the macro level 1 12 1. 

In our view, the above epistemological analogies are even more fundamental than 
the (accidental) mathematical analogy between eq.( |1.2| i and Netwon's law. Another 

' Examples of favouring factors are: trade agreements, membership to common economic groups, 
shared geographic borders, common currency, etc. Examples of suppressing factors are: embar- 
goes, trade restrictions, and other factors representing a trade friction. 
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Fig. 1.1 Ehrenfest's students, 
Leiden 1924. Left to right: 
Gerhard Heinrich Dieke, 
Samuel Abraham Goudsmit, 
Jan Tinbergen, Paul Ehren- 
fest, Ralph Kronig, and Enrico 
Fermi (copyright @ Chicago 
University Press). 




deep connection between physics and economics exists at a personal level: Jan Tin- 
bergen, the founder of the Gravity Model of trade, was a physicist before becoming 
an economist. 

Bom in Den Haag, the Netherlands, in 1903, Jan Tinbergen started his studies in 
mathematics and the natural sciences at the University of Leiden, soon after gradu- 
ating from high school in 1921 with the highest honors. In Leiden, he later started 
a PhD in physics under the supervision of Paul Ehrenfest, who was then professor 
in Theoretical Physics (see fig |l.l| l. Tinbergen became Ehrenfest's assistant, the pri- 
vate tutor of Ehrenfest's son, and a frequent visitor of Ehrenfest's house, that was 
regularly visited also by Einstein, Bohr, Heisenberg, Fermi and Pauli. Tinbergen had 
always been attracted by economics, and Ehrenfest was interested in the analogies 
between economics and physics. This resulted in Tinbergen's PhD thesis, entitled 
'Minimum Problems in Physics and Economics' and defended in 1929. Shortly af- 
ter, despite Ehrenfest had repeatedly tried to convince him to remain a physicist, 
Tinbergen started a brilliant career as an economist. His pioneering views led him 
to introduce Econometrics, a synthesis between mathematics, economic theory and 
statistics. In Tinbergen's view, economic theory should formulate hypotheses trans- 
lated into mathematical relations that are then statistically tested on empirical data. 
This distinctive quantitative approach was almost surely due to Tinbergen's graduate 
training as a physicist. His idea of introducing a quantitative model of international 
trade flows is clearly in line with this approach. Jan Tinbergen's career culminated 
in 1969 when he received the first Bank of Sweden Prize in Economic Sciences in 
Memory of Alfred Nobel or shortly Nobel Memorial Prize in Economics, often mis- 
takenly referred to as the 'Nobel Prize in Economics' (that, strictly speaking, does 
not exist). 

As a physicist, Jan Tinbergen of course knew Netwon's law very well, a knowl- 
edge that might have facilitated making a mathematical connection to the study 
of international trade. But, we believe, his idea of using the Gravity Model in his 
research as an economist was most probably triggered by the deeper similarities. 
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discussed above, at the epistemological level. Jan Tinbergen's familiarity with the 
scientific method universally used in physics is probably the reason why his many 
achievements as an economist are all characterized by a strong quantitative approach 
and a clear focus on empirical data. Without trying to distort scientific and personal 
history, we might therefore presume that Tinbergen's view was not too far from 
what, in modem jargon, are the inspiring concepts of 'Econophysics'. His Grav- 
ity Model of trade can also be regarded as the first model of the system that, in 
the more recent Complex Networks literature, has been intensively studied under 
the names of 'International Trade Network' (ITN) or 'World Trade Web' (WTW) 
191 [15] [l6l [TTl [IS] [l9l. Therefore, in our view, Jan Tinbergen's pioneering work 
deserves full attention from the scientific community active in Econophysics and 
Network Science. Along these lines, the Gravity Model and the ITN, both still un- 
der intense investigation, can be considered two brilliant examples of how ideas 
from physics can fruitfully interact with economic problems. 



1.3 The network approach 

At the time of Tinbergen's analyses, data about international trade flows were of 
course much less accurate than today. Missing data were the rule rather than the ex- 
ception, so it was practically impossible to distinguish between the absence of data 
documenting an existing trade relationship and a 'true' absence of the relationship 
itself in the real world. While this confusion still cannot be completely eliminated, 
in modern databases 120) 1211 it only affects a few percent of the data. A simple 
analysis of such databases yields to a systematic result; in each yearly snapshot of 
the ITN from the 50's until now, only 50-60% of the total pairs of world countries 
are found to be connected by trade relationship 19] [T6ll . With little error, pairs of 
countries that do not trade at all are the remaining fraction. 

If we look again at eq. ( |1.2[ l, we immediately see that the observation of a 'half- 
connected world' is in contrast with the predictions of the Gravity Model, as it 
cannot predict zero trade flows rl Exactly as the gravitational force between any two 
masses (no matter how small or distant) is never zero according to Newton's law, 
the Gravity Model predicts that trade exchanges between any two countries (no 
matter how poor or distant) are always positive. However, while any two massive 
objects are indeed found to be attracted over cosmological distances in our Universe, 
the observation of an economically half-connected world implies that the Gravity 
Model fails in reproducing the missing links of the world trade networkrVivi other 



^ Strictly speaking, tlie introduction of an error term into eq.i 1 .2 > allows to have zero or even neg- 
ative values. However, after fitting the model to the data, or simply in order to avoid the generation 
of precisely those unrealistic negative values, the variance of the eiTor term is so small that zero 
trade flows have a vanishing probability. 

' In principle, also this limitation can be overcome if the Gravity Model is extended into the 
so-called zero-inflated models 1221 1231 that use eq.l |1.2[ l (or its generalizations) in a two-step pro- 
cedure: first in order to estimate the probability of a trade connection, and then in order to estimate 
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Fig. 1.2 The heterogeneous topology of the binary World Trade Web. Left: average nearest neigh- 
bour degree k"" as a function of the degree k, for all vertices. Right: clustering coefficient c as a 
function of the degree k. Data: UNCOMTRADE database |21 1, year 2001 {N = 162 countries). 



words, if the set of existing connections (i.e. the topology) of the ITN is preHminarly 
specified, then the Gravity Model succeeds in reproducing the magnitude of trade 
connections. But in general, it fails in reproducing the observed topology of the 
network. 

It is interesting to notice that the awareness of the importance of the binary topol- 
ogy of a network, besides that of weighted structural properties, is a recent conquest 
of Network Science - clearly absent at the time of Tinbergen. Motivated by this 
awareness, the analyses of the ITN carried out over the last decade have documented 
an intricate and heterogeneous topology. Let us for instance consider the undirected 
version of the ITN, where two countries (the nodes, or vertices, of the network) are 
connected by a link (or edge) if there exists at least a trade relationship (in any di- 
rection) between them. In this network, the number of connections (the so-called 
degree, denoted by k) of world countries is found to be very broadly distributed, 
with poor countries having only one or two connections (typically including the 
USA) and rich countries being connected to a significant number of partners, up to 
the total number of countries in the world. This result is very robust, since the degree 
is found to systematically increase with the GDP 19|. Moreover, (anti)correlations 
between the degrees of two trading partners are significant: the average degree of 
trade partners (the average nearest neighbour degree, denoted by k™) is smaller for 



countries with larger degree |9, 15| (see fig. 1.2i. This means that more connected 
(richer) countries trade with countries having on average a smaller number of part- 
ners, and less connected (poorer) countries trade with countries having on average a 
larger number of partners. A similar result holds for the so-called clustering coeffi- 
cient (denoted by c) of a country, defined as the realized fraction of links (local link 
density) among the partners of that country. Just like k™, c is found to decrease as k 
increases, meaning that more connected countries have a less interconnected neigh- 
bourhood, and less connected countries have a more interconnected neighbourhood 
El Us I (see fig. |1.2[ ). All these topological properties can be generalized to the di- 

the intensity of the connection. However, recent analyses |23| have shown that this procedure pro- 
vides a bad fit to the observed network: when used in order to estimate link probabilities and link 
intensities simultaneously, the Gravity Model turns out to be a very bad model. 
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Fig. 1.3 The nontrivial structure of the weighted World Trade Web. Left: average nearest neigh- 
bour strength s"" as a function of the strength s, for all vertices. Right: weighted clustering coeffi- 
cient c" as a function of the strength s. Data: UNCOMTRADE database (21], year 2001 (N = 162 
countries). The values are rescaled by the total weight. 



reeled version of the network (where hnks follow the direction of, say, exports), and 
similar results are found lfT6l [TSl . 

Even when weighted properties of the network are studied, the importance of the 
underlying topology is still manifest, e.g. when local averages of weighted quantities 
are performed. For instance, let us consider the weighted analogue of the degree, i.e. 
the strength (denoted by s) defined as the total weight of the links of a country (the 
total value of imports and exports for that country). As in the binary case, the av- 
erage strength of the partners of a country (the average nearest neighbour strength, 
denoted by i"") is found to decrease as the strength of that country increases fTT^TSl 
(see fig. 1.3 i. Being a local average over the partners of each country, s™ is strongly 
influenced by the degree, which is a binary property. Similarly, a weighted general- 
ization of the clustering coefficient (denoted by c*") is also influenced by the binary 
structure, since it is still defined on the local neighbourhood of countries. Unlike its 
binary counterpart, c" is found to increase as the strength increases liTTlfTSl (see fig. 

[T3J. 



1.4 Statistical physics and maximum-entropy models 



Taken together, the above findings highlight that the topology of the ITN is nontriv- 
ial and very different from the complete network predicted by the Gravity Model. In 
the previous section we discussed some coincidences and deeper similarities behind 
the use of the gravity law in physics and economics. As another interesting similar- 
ity, recent results 1 16^, Yh J^ 24J suggest that the limitations of the Gravity Model 
can only be overcome after a change of paradigm which is not dissimilar from the 
one that accompanied two revolutions in physics, namely the advent of statistical 
mechanics and that of quantum physics. The new paradigm assumes that, in order 
to predict the presence of a link (and not only its weight), probabilistic models of 
networks need to be considered. The great conceptual jump consists in assuming 
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that the observed network is not unique, but one of many possible realizations, each 
of which has a probability P to occur This probability must be determined by es- 
tablishing which of the properties of the network are somehow 'unavoidable', and 
assuming that all possible networks displaying those properties (including the real 
network) are equally probable. This change of approach is equivalent to the one 
leading to the introduction of statistical physics: if the detailed microscopic con- 
figuration of a large system is unknown (as is generally the case), and only a few 
macroscopic quantities are known (e.g. the total energy), then some properties of 
the system can be inferred by averaging over all possible configurations compati- 
ble with the known macroscopic quantities. The probability of each configuration 
therefore depends on the choice of the macroscopic quantities to be reproduced. 

As Jaynes pointed out in his work devoted to the connections between statisti- 
cal mechanics and information theory |25|, the fundamental problem of statistical 
physics can be regarded as a particular case of a more general class of problems 
of inference from partial information. In the general case, one looks for the prob- 
ability distribution that maximizes the uncertainty about the system, given the par- 
tial knowledge of the latter Mathematically, if C denotes a possible (microscopic) 
configuration of the system, the solution of the problem is obtained maximizing 
Shannon's entropy 

5=-£P(C)lnP(C) (1.3) 

c 

subject to a set of constraints, representing what is known about the system ESJI . 
The result of this constrained maximization problem is the probability 

P{C) = -^- (1.4) 

where li(C) is a linear combination of the constraints (where each constraints is 
coupled to its Lagrange multiplier) and 

Zee^.-«(c) (1.5) 

c 



is the normalization factor. The ensemble of configurations generated by eq.( |1.4| l is 
the maximum-entropy ensemble specified by the chosen constraints. 

Jaynes noticed that, if the system under consideration is a physical one, and 
the only constraint is the total (macroscopic) energy E{C), then H{C) ~ PE{C) 
where )3 is the Lagrange multiplier ensuring that (the ensemble average of) E{C) 
can be set equal to its observed value. This leads to the identification of eq.( |1.4| i 
with the Gibbs-Boltzmann distribution, if j3 is identified with the inverse tempera- 
ture through /3 = {kT)~^ (k being Boltzmann's constant). Automatically, this also 
shows that eq.( |1.5[ l can be identified with the partition function, and eq.(1.3 1 with 
the Gibbs-Boltzmann entropy. 



Coming to the case of networks, it has been shown 11261 1271 1281 I29II that the class 
of network models known in the social sciences under the name of Exponential 
Random Graphs or p* models I.26J is also a particular case of the above maximum- 
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entropy problem. In these models, the constraints are (not necessarily macroscopic) 
topological properties that one wants to control for. Consider for instance the case of 
binary graphs with a given number A^ of vertices. In undirected binary graphs, each 
pair of vertices is either connected or not, with no possible variation in the direction 
and intensity of the connection. Each configuration C is uniquely specified by the 
adjacency matrix A of the graph, defined as a symmetric N xN matrix with entries 
aij = 1 if a link exists between the vertices / and j, and a,,- ~ otherwise. There- 
fore, we can label each configuration with A rather than C, and the corresponding 
probability with P{A). The simplest example is when the only constraint is the total 
number L of links. It has been shown ll26l I28JI that this particular case reduces to 
the popular Erdos-Renyi random graph model, where all pairs of vertices are con- 
nected independently of each other and with the same probability p (which can be 
viewed as the only Lagrange multiplier, uniquely specified by the observed value of 



L). In this model, eq.( 1.4 1 translates into the following probability P{A), that simply 



factorizes over all pairs of vertices /, j: 

p{A)^l\p"Hi-py-"-j=p''^^\i-pf^''-'y^-^^^^ (1.6) 

This expression shows that the generation of an entire graph A is the combination 
oiN{N — l)/2 independent Bernoulli trials^ each corresponding to the creation of 
a single link and characterized by the same success probability p. 

The above simple example shows that individual links naturally inherit, from 
the maximum-entropy structure of the overall model, the character of random vari- 
ables, to be described by probability distributions. If weighted networks are con- 
sidered, maximum-entropy models can still be defined 1.26. ,28J and lead again to a 
probabilistic description of the weight of all links, including the possibility of zero 
weights which correspond to missing links (we will discuss explicit cases later). 
Thus, in both binary and weighted descriptions, the probabilistic character of link 
creation characterizing the maximum-entropy approach eliminates the need to spec- 
ify the topology of the network as 'given', overcoming the limitation encountered 
when using the Gravity Model. This makes maximum-entropy graph ensembles a 
potentially successful approach to the analysis of the ITN and economic networks 
in general. However, two aspects remain to be discussed: 

• one needs of course to check whether a suitable choice of constraints can indeed 
reproduce the empirical properties of the ITN: this requires the identification of 



'^ A Bernoulli trial (or Bernoulli process) is the simplest random event, i.e. one characterized 
by only two possible outcomes. One of the two outcomes is referred to as the 'success' (in this 
case, the creation of a link) and is assigned a probability p. The other outcome is referred to as 
the 'failure', and is assigned the complementary probability 1 — p. Equation l |1.6[ l is indeed the 
product of the probability p^W of L(A) successful events of link creation times the probability 
(1 — p)'^('^^i)/-^^('*) of the complementary number of failures, where L{A) is the number of links 
in the particular graph A. Note that N[N — 1 )/2 is the total number of pairs of N vertices: we are 
uninterested in self-loops, so the diagonal matrix entries are c;„ = 0, which leaves us with only 
N{N — 1 )/2 degrees of freedom in a symmetric N x N adjacency matrix. For the same reason, the 
sum in eq.dl.6| runs over pairs with i < j, i.e. only over the upper triangle of the matrix A. 
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topological constraints that are both reasonable (i.e. they can be a priori justi- 
fied as a meaningful choice) and effective (i.e. they are a posteriori successful in 
replicating the ITN); 



• 



even if the specification of appropriate topological constraints turns out to satis- 
factorily reproduce the observed network, one needs to understand whether this 
result can be reconciled with, or at least related to, the main idea of the Gravity 
Model: the assumption that trade strongly depends on non-topological quantities 
such as GDP and distances. 



In sections 1.5|1.8 we will address the first point in detail, while in sec. 1.9 we will 



deal with the second one. 

So far, our general discussion has highlighted that the transition from the Gravity 
Model to maximum-entropy models is analogous, both conceptually and mathe- 
matically, to the paradigm shift that led to the introduction of Statistical Physics 
at the beginning of the twentieth century. The common aspect in both cases is the 
probabilistic description of the system. As we now show, another common aspect 
leads to a further formal similarity: the discreteness of both economic quantities and 
microscopic particles implies that, when a specific choice of constraints is made, 
eq.( |1.4[ i leads to the same mathematical expressions that are encountered in Quan- 
tum Physics. These expressions are the so-called Fermi-Dirac and Bose-Einstein 
statistics^ 



1.5 Fermi-Dirac statistics 

Let us first consider binary networks, i.e. let us focus only on the presence/absence 
of links. In order to simplify the discussion, let us also consider undirected networks 
(the results that follow can be straighforwardly extended to directed configurations). 
We have already mentioned the example of the random graph model, obtained when 
the only constraint is the total number of links. That model is very simple, but 
severely limited by its complete homogeneity: all vertices have approximately the 
same topological properties, narrowly distributed around a common average value. 
This is in stark contrast with the strong heterogeneity of most real-world economic 



networks, including the ITN as we already discussed in section 1.3 If we want to 



build a maximum-entropy model of the ITN whose topology is a real improvement 



' In quantum physics, fundamental particles are believed to be of two types: fermions or bosons, 
depending on the value of their spin (an intrinsic 'angular moment' of the particle). Fermions have 
half-integer spin and cannot occupy a quantum state (a configuration with specified microscopic 
degrees of freedom, or quantum numbers) that is already occupied. In other words, at most one 
femiion at a time can occupy one quantum state. The resulting probability that a quantum state is 
occupied is known as the Fermi-Dirac statistics. Bosons have integer spin and can occupy states 
with no restriction: any non-negative integer number of bosons can occupy the same quantum state. 
The resulting expected number of particles occupying a given quantum state is described by the 
so-called Bose-Einstein statistics. 
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over the Gravity Model, we need to reproduce the observed heterogeneity of the 
network. To this end, it is necessary to enforce different constraints that lead to 
more complicated models. One of the widespread choices in network theory is to 
consider an ensemble of networks where each vertex / has the same degree A:, as 
in the real network. This choice is justified by the fact that, being an entirely local 
topological property, the degree is expected to be directly affected by some intrinsic 
(non-topological) property of vertices. For instance, we already anticipated that in 
the ITN the degree of a country increases with the GDP of the latter ||9l- It would 
of course not make sense to compare the real ITN with a randomized counterpart 
where the degree of a country no longer corresponds to a realistic value of its GDP 
(for instance, where the USA have only one or two connections). This leads us to 
interpret the observed degrees of countries as 'unavoidable' topological constraints, 
in the sense that the violation of the observed values would lead to an 'impossible', 
or at least very unrealistic, world trade network. 

The resulting model is known as the Configuration Model, and is defined as a 
maximum-entropy ensemble of graphs with given degree sequence 1261 |28i| . The 
degree sequence, which is the constraint defining the model, is nothing but the or- 
dered vector k of degrees of all vertices (where the /th component kj is the degree 
of vertex /). The ordering preserves the 'identity' of vertices: in the resulting net- 
work ensemble, the expected degree {kt) of each vertex / is the same as the empirical 



value ki for that vertex. In the Configuration Model, eq.( 1.4 1 translates into the graph 
probability 

P[A) = Y\qu{aij) = Y\p;-;{l-pijY-"- (1.7) 

'■<;■ '■<;■ 

where qij{a) = p"j{l — PijY" is the probability that that particular entry of the 
adjacency matrix A takes the value Ojj = a. The above expression shows that the 
creation of a link has still the form of a Bernoulli process (see footnote Hh, but now. 



unlike the random graph model described by eq.( 1 .6 1, different pairs of vertices are 



characterized by different connection probabilities pij. These probabilities read 

^U)-P,-Y^ (1.8) 

where x, is the Lagrange multiplier obtained by ensuring that the expected degree of 
the corresponding vertex / equals its observed value: (kj) — k, V/ (lE). Note that, as 



always happens in maximum-entropy ensembles described by eq.(1.4i, the proba- 
bilistic nature of configurations implies that the constraints are valid only on average 
(the angular brackets indicate an average over the ensemble of realizable networks). 
Also note that pij is a monotonically increasing function of jc, and Xj. This implies 
that {ki) is a monotonically increasing function of x,. An important consequence is 
that two countries / and j with the same degree ki = kj must have the same value 

Equation ( |1.8| l provides an interesting connection with quantum physics, and in 
particular the statistical mechanics of the microscopic particles known as fermions 
(see footnote l5]l. The 'selection rules' of fermions dictate that only one particle at 



1 Jan Tinbergen's legacy for economic networks 13 

a time can occupy a single-particle state, exactly as each pair of vertices in binary 
networks can be either connected or disconnected. In this analogy, every pair i,j 
of vertices is a 'quantum state' identified by the 'quantum numbers' / and j. So 
each link of a binary network is like a fermion that can be in one of the available 



states, provided that no two objects are in the same state. Equation (1 .8 i indicates the 
expected number of particles/links in the state specified by / and j. With no surprise, 
it has the same form of the so-called Fermi-Dirac statistics describing the expected 
number of fermions in a given quantum state 1261 l28l |29l . As we already discussed, 
the probabilistic nature of links allows also for the presence of empty states, whose 
occurrence is now regulated by the probability coefficients (1 — pij). 

We now come to the application of the model to the topology of the ITN. 
Unlike the Gravity Model, the Configuration Model allows the whole degree se- 
quence of the observed network to be preserved (on average), while randomizing 
other (unconstrained) network properties ll28l . In order to check whether the model 
successfully reproduces the ITN, one needs to compare the higher-order (uncon- 
strained) observed topological properties with their expected values calculated over 
the maximum-entropy ensemble. This automatically indicates whether the degree 
sequence is informative in explaining the rest of the topology. This can be done 



analytically, by means of the probabilities appearing in eq. (1.8 1 |28|. The effec 



tiveness of the degree sequence in reproducing other topological properties of the 



ITN is shown in fig. 1.4 where we compare the observed values of the average 



nearest neighbour degree k"" and clustering coefficient c, (defined in sec. 1.3 i with 
the corresponding expected values (A:™) and (c,), for all vertices. In this type of 
plot, the agreement between model and observations can be simply assessed as fol- 
lows: the less scattered the cloud of points around the identity function, the better 
the agreement between model and reality. In principle, a broadly scattered cloud 
around the identity function would indicate the little effectiveness of the chosen 
constraints in reproducing the unconstrained properties, signalling the presence of 
genuine higher-order patterns of self-organization, not simply explainable in terms 



of the degree sequence alone. However, the results in fig. 1.4 indicate that the World 
Trade Web is well reproduced by the Configuration Model. This result is very ro- 
bust, as documented by recent analyses that have confirmed it for different temporal 
snapshots, different levels of aggregation (up to individual commodities), and dif- 
ferent datasets |fT6l[T8l . With the appropriate generalizations, this conclusion also 
holds when the ITN is analysed as a directed network, still well described by the 
Fermi-Dirac statistics lfT6l [T9l. 



For completeness, fig. 1.4 also shows the Gravity Model's prediction of a com- 
plete network, which is dramatically different from the observed one. Thus the 
maximum-entropy approach, and in particular the Configuration Model, represents a 
significant advantage with respect to the Gravity Model. An unexpected implication 
is that the degrees of world countries are maximally informative about the ITN as a 
whole: if the empirical degree sequence of the ITN is not reproduced, the observed 
topology of the network as a whole will not be reproduced either |fT6| . Unfortu- 
nately, current micro-founded models in the economics literature do not attempt at 
replicating or explaining the particular value of the number of trade partners of a 
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Fig. 1.4 The topology of the World Trade Web is well reproduced by specifying the number of 
trade partners of each country (binary Configuration Model). Left: observed VS expected average 
nearest neighbour degree k"", for all vertices. Right: observed VS expected clustering coefficient 
c. The red curves are identity lines (perfect agreement). The green curves represent the prediction 
for the same quantities under the Gravity Model, which instead predicts a completely connected 
network with all vertices characterized by the same value. Data: UNCOMTRADE database |21 1, 
year 2001 (N = 162 countries). 



country. Rather, they aim at reproducing the Gravity Model, inspired by the success 
of the latter at the level of non-zero flows ifTSl .In so doing, these models are destined 
to fail in explaining the heterogeneous topology of the ITN. To overcome this lim- 
itation, future models should aim at replicating the degree sequence explicitly. As 
we anticipated at the end of sec. |1.4| one should also investigate the relationship be- 
tween the 'empirically informative constraints' in the maximum-entropy approach 
(the degree sequence in this case) and the 'macroeconomic explanatory factors' in 
the Gravity Model approach (such as GPD and distances). We will discuss this im- 
portant point in sec. 



We conclude this section by stressing again that the 'fermionic' character of the 
ITN, when treated as a binary network, is the mere result of the restriction that no 
two binary links can be placed between any two vertices, leading to a mathematical 
result which is formally equivalent to the one of quantum statistics. Clearly, there is 
nothing really 'quantum' in trade connections being described by the Fermi-Dirac 
statistics, exactly as there is nothing really 'gravitational' in non-zero trade flows 
being described by the Gravity Model. Still, the deep epistemological analogies 



leading to similar laws in physics and economics (the ones we discussed in sec. 1.2 1 



remain, and are now translated into a more sophisticated formalism that allows for 
the probabilistic and discrete nature of the system: 

• In both physical and economic applications, the Fermi-Dirac statistics has the 
following symmetry: the probability pij only depends on the combination x,Xj. In 
quantum physics, XjXj in turn depends only on the energy of the quantum state, 
while in the Configuration Model it depends only on the end-point degrees ki 
and kj. This means that, all else being equal (e.g. given the same energy, or the 
same values of the end-point degrees), the occupation probability of two differ- 
ent states {i,j) and {m,n) is the same. 



1 Jan Tinbergen's legacy for economic networks 15 

• In both applications, the Fermi-Dirac statistics implements the qualitative idea 
that, the larger the value of XiXj, the higher the probability that the state {i,j) is 
occupied. 

• Finally, the validity of the Fermi-Dirac statistics is in both cases established a 
posteriori, by the fact that it reproduces empirical observations. This phenomeno- 
logical agreement confirms that the postulated quantum numbers/topological 
constraints, which uniquely specify the values {x,}, are indeed the (only) rele- 
vant ones for the problem under investigation. 



As we will discuss in sec. 1.9 in the ITN the value of XjXj can also be related to the 
GDP of the two countries / and j, and to the geographic distance separating them. 
This restores a tight correspondence between the three points listed above and the 
three ones discussed in section [L2l 



1.6 Bose-Einstein statistics 

We started this chapter stressing the importance of the ITN as a complex weighted 
network, while in the previous section we restricted ourselves to the description of 
its purely binary topology. From this point onwards, we go back to the full weighted 
level and discuss whether it is possible to reproduce the topology and weights of 
the ITN simultaneosly. Naively, the results shown so far suggest that a first attempt 
in this direction could be the introduction of a two-step process where the topology 
is first established using the Configuration Model, and the realized link weights are 
then estimated using the Gravity Model. However, besides being disappointingly 
inelegant, this approach would result in a hybrid combination of maximum-entropy 
and economically inspired expectations, leaving the final results without a clear in- 
terpretation. A more satisfactory way to proceed is expanding the maximum-entropy 
formalism into one valid for weighted networks, automatically closer to the macroe- 
conomic reasoning. 

To many economists, the finding that the number of trade partners of a country 
is a particularly informative quantity might appear weak or misleading, given the 
expectation that the monetary value of imports and exports is in principle much more 
informative: common sense suggests that knowing how much (in dollars) a country 
trades with the rest of the world should be more informative than just knowing 
how many partners trade with that country. This leads to the expectation that the 
strength s should be more informative than the degree k. One might therefore suspect 
that an even better model of the ITN, still preserving the observed heterogeneity 
of countries, is one where the strengths, rather than the degrees, are enforced as 
constraints. In this section, we take this approach and show that it actually leads to 
a counter-intuitive result: unlike what we found for the degree sequence, knowing 
the strength of each world country turns out to be very poorly informative about the 
structure of the ITN as a whole. 
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The theoretical framework introduced in the previous section allowed us to treat 
links as probabilistic entities in order to overcome the Gravity Model's prediction 
of a completely connected network. Since we were only interested in the prediction 
of the presence or absence of links, the selection rules were formally analogue to 
the fermionic ones: Ojj = 0, 1. However, the formalism can be generalized in order 
to analyze weighted networks where links can have non-negative integer weights. If 
we keep considering undirected graphs for simplicity, each network is now specified 
hy a N X N symmetric weight matrix W whose entry Wjj equals the weight of the 
link between the vertices / and j. Therefore, we can now label each configuration 
with W, and the corresponding probability with P{W). If we define the strength se- 
quence s as the ordered vector of strength values (with components s^, i~l,.. . ,N), 
the Weighted Configuration Model can be introduced as the ensemble of weighted 
networks with given strength sequence. If one allows each Wij to take non-negative 
integer numbers, eq.(|1.4[i now becomes 



i<j i<j 

Where qij{w) = pj^(l — Pij), which has now the form of a geometric distributional 
is the probability that the vertices / and j are connected by a link of weight w. The 
outcome w = 0, corresponding to a missing link, occurs with probability 1 —/?,;. 
Therefore ptj still denotes the probability that / and j are connected, irrespective of 
the weight of this connection. This probability now reads 

Pij=yiyj (1-10) 

where yt is the Lagrange multiplier required in order to ensure that the expected 
strength (,?,) of each vertex / equals the empirical value Sj 128]. This time, two coun- 
tries / and j with the same strength si — Sj (independently of their degrees) must 
have the same value y, — yj jTTl . 

As in the previous case, a connection with another well-known quantum statistics 
emerge. The 'selection rules' have now allowed us to treat link weights as formally 
analogue to bosons (see footnote [5]l, admitting multiple and unlimited occupations 
of the same 'quantum state' (w,,- = 0, 1,2... +°o). Indeed, the expected occupation 
number of a quantum state, which is the expected weight of the link between vertices 
i and j, is now formally identical to the so-called Bose-Einstein statistics j 



K) = ^ = ^^^ (1.11) 

As before, there is nothing fundamental in the mathematical analogy with quantum 
statistics, the only common element being the postulated discreteness of the num- 



* In one of its possible formulations, the geometric distribution describes the probability that, in a 
sequence of repeated Bernoulli trials (see footnote|4l with success probability p, the first w trials 
are all successful and the following one is unsuccessful. This happens with probability p"' ( 1 — p) . 
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Fig. 1.5 The topology and weights of the World Trade Web are NOT well reproduced by specifying 
the total import and export value of each country (weighted Configuration Model). Left: observed 
VS expected average nearest neighbour strength s"", for all vertices. Right: observed VS expected 
weighted clustering coefficient c"'. The red curves are identity lines (perfect agreement). The green 
curve represents the prediction for the same quantities under the Gravity Model, which instead 
predicts a completely connected network with all vertices characterized by the same value. Data: 
UNCOMTRADE database ||21J, year 2001 (A' = 162 countries). The values are rescaled by the 
total weight. 



bers w,; PI The deeper similarity involves again the concept of symmetry, which in 
this case refers to the assumption that, all else being equal, in the Bose-Einstein 
statistics the expected value (w,j) only depends on ytyj. Similarly, the common 
qualitative aspect shared by the physical and economic applications is that (vv,y) is 
expected to increase with ytyj. However, as we now show, this time the phenomeno- 
logical analogy between the Weighted Configuration Model of trade and quantum 
statistics (i.e. the agreement of both with real data) breaks down: while the Bose- 
Einstein distribution describes the microscopic world of bosons remarkably well, 
eqs.( 1.9|fl.ll| l fail miserably in reproducing the observed ITN. 

To see this, one can again compare the observed and the expected values of 



higher-order (unconstrained) topological properties. In fig. 1.5 we show the average 
nearest neighbour strength s"" and weighted clustering coefficient c" (see sec. 1.3 1. 



The results now indicate how bad the accordance between the Weighted Configu- 
ration Model and the real network is. Interestingly enough, the model's prediction 
for the average nearest neighbors strength is almost identical to the Gravity Model's 
prediction for the same quantity. This unambiguously indicates that the two mod- 
els suffer from the same limitation: their incapabaility to reproduce the topology 
of the observed network and, in particular, the fact that the Weighted Configura- 
tion Model generates an extremely dense network fTTl, not too different from the 
completely connected topology predicted by the Gravity Model (for a fully con- 
nected network whose weights are rescaled by the total weight, it is easy to estimate 
s"" ~ Y.i'''i/N = 2/A^). Similarly, even if the smaller values of the weighted clus- 
tering coefficient seem to partially agree with the model's prediction, the behaviour 
for large values indicates that major refinements are needed in order to improve the 



' In quantum physics, the discreteness is implied by the fact that particles can only exist in integer 
number. In economic networks, the discreteness is implied by the fact that money can only exist in 
integer multiples of a fundamental, indivisible unit of currency (such as one Eurocent). 
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model performance ifTTl . The disagreement between the Weighted Configuration 
Model and the real ITN has been confirmed on different data, different temporal 
snapshots, and different commodities ifTTlfTSl . 

The above result contradicts the intuitive expectation that, by taking a weighted 
quantity (the strength sequence s) as input, the Weighted Configuration Model 
should be more informative than its binary counterpart. Indeed, while the complete 
knowledge of all the weights of the network is of course more informative than the 
knowledge of the binary topology alone, it turns out that the partial knowledge of 
the weighted network (the strength sequence in this case) is less informative than the 
knowledge of the corresponding binary quantity (the degree sequence). A somewhat 
puzzling consequence for macroeconomic modeling is that, even if a micro-founded 
model of international trade successfully reproduces the observed total imports and 
exports of all world countries, this is definitely not enough in order to reproduce 
the structure of the ITN as a whole. If combined with the previous result about the 
extreme informativeness of the degree sequence, this finding strengthens the uncon- 
ventional conclusion that satisfactory models of international trade should aim at 
primarily reproducing the binary properties of countries (number of trade partners) 
rather than their weighted ones (total import and export) ||16II17|[T8]| . 



1.7 Generalized quantum distributions 

One therefore needs to look for a better model of the International Trade Network, 
able to reproduce the binary topology and the weighted structure of the network si- 
multaneously. Since microscopic particles are either fermions or bosons, the Fermi- 
Dirac and Bose-Einstein distributions are the only two types of quantum statistics 
traditionally used in physicsj^However, one can formally introduce generalized dis- 
tributions that reduce to the Fermi-Dirac and the Bose-Einstein statistics as partic- 
ular cases. While the physically realizable quantum systems might correspond only 
to the fermionic and bosonic extremes, it might well be that other systems, such as 
economic networks, can instead realize other non-trivial limits of those generalized 
distributions. Therefore, in this and in the next section we discuss two possible gen- 
eralized 'quantum' statistics and their relationship with the structure of economic 
networks, and the ITN in particular. 

The case we consider in this section is just a pedagogical example, while the 
one we discuss in the next section leads to a very important model that reproduces 
the observed ITN in great detail. As we showed, the only mathematical ingredi- 
ent needed to generate the Fermi-Dirac statistics in maximum-entropy network en- 
sembles is the 0/1 character of binary links iaij = 0, 1), while the only ingredient 
needed to generated the Bose-Einstein statistics is the non-negative integer char- 
acter of weighted links (w,; = 0, . . . , +°°). For non-physical systems, there is no 

^ Excluding the case of any ons f30l, particles that can only exist in two dimensions and that can be 
described by a generalized 'fractional statistics' tSlJ. which is however unrelated to the extensions 
we discuss in this section and in the next one. 
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Fig. 1.6 Single-link cumu- 
lative distribution functions 
of the generalized statistics, 
defined in eq.JTTTS}, rang- 
ing from the Fermi-Dirac 
(the blue one on the left, 
with Wmax = 1) to the Bose- 
Einstein (the blue one on 
the right, with w,,,,,, — i- °°). 
The X-axis is logarithmically 
scaled. 




reason why the only two allowed values for the maximum weight should be one 
and infinite. In general, we can consider a general family of distributions, obtained 
when the occupation number can range from to a finite maximum value w,„ax- 
All the distributions within this family share the same discrete character, due to 
the integer occupation numbers of 'quantum states'. However, they only reduce to 
the Fermi-Dirac and Bose-Einstein distributions in the extreme cases w,„ax = 1 and 
Wmax —>■ +°° respectively. 

In the general case, with w,y =0, 1,2... Wmax, the maximum-entropy ensemble 
of networks with given strength sequence s is characterized by the probability dis- 
tribution 



PiW) =U^ijiw,j) =11 






l-Jij 



1 \ '^ 



(^l-'max I 1 ) 



I-n 



(1.12) 



where for simplicity we have defined ytj = yiyj, if y; still denotes the Lagrange mul- 
tiplier needed to enforce the constraint (,?,) = i,. Like eqs. ( |L7| and ( L9 1, the above 
probability is still a product over single-link distributions, each characterized by the 
same, bounded range of values. In order to better visualize the functional form of 
such distributions, the corresponding single-link cumulative distribution functions 
can be plotted, as shown in fig.|L6| The latter can be computed quite easily as 



i^vK >o) = i -<?,-, (0) = 



yiji}-ylr) 



i-yi 
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(1.13) 



As Wmax increases from 1 to +o°, the intersections of these distributions with the 
y-axis form an interesting numerical succession, whose generic term is 



^,;K>Obo = l) 



W„ 



1 2 3 



(1.14) 



and whose limit when Wmax -^ +°° is 1 • 

The generalized distribution considered above is an example showing how it 
is possible to gradually interpolate between the Fermi-Dirac and Bose-Einstein 
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through the introduction of an extra parameter {w,nax)- In principle, intermediate 
values of the parameter can lead to different results than the ones we showed in 
sec. 1 .6 and potentially to an improvement over them. However, since the observed 
weights in the ITN are extremely large, the value of w„„„ required in order to gener- 
ate realistic weights will be so large that the results are practically indistinguishable 
from those we have already discussed using the Bose-Einstein distribution. So this 
model does not represent a real improvement. 



1.8 Mixed Bose-Fermi statistics 

A fundamentally different way to unify the Fermi-Dirac and Bose-Einstein distri- 
butions, other than introducing an extra parameter while keeping the strength se- 
quence s as the constraint, is adopting a different choice of the constraint them- 
selves. Specifically, motivated by the success of the binary Configuration Model 
discussed in sec. |1.6[ one can introduce a maximum-entropy ensemble of networks 
with given degree sequence k and strength sequence s, i.e. one where both con- 
straints are enforced simultaneously ||32]| . To this end, the maximum allowed value 
of weight is again w,nax = +°° as in the Bose-Einstein case. However, in terms of 
theoretical models, this leads to a whole new family of probability distributions, 
whose functional form is ||32]| 






ixiXjY-Jyiyj{yiyjpJ-'{l-yiyj) 
l-yiyj+XiXjyiyj 



(1.15) 



where a,^, the element of the adjacency matrix, is if Wij — and 1 if Wij > 0. In 
the above expression, the x vector controls for the degrees and the y vector controls 
for the strengths. This double set of constraints implies that, while different pairs 
of vertices are still independent, the creation of a link of given weight between two 
vertices is neither a Bernoulli nor a geometric process (see footnotes HI and l6]), but a 
combination of the two. 

As in the previous example, the (now generalized) 'quantum' or discrete charac- 
ter of the statistics becomes evident as soon as the expected occupation numbers are 
computed ll32l : 

1 - ytyj + XiXjyiyj ( 1 - yiyj + XiXjytyj ) ( 1 - yjyj ) 

In this case, the unification of the Fermi-Dirac and Bose-Einstein distributions is 
achieved by combining binary and weighted constraints 'as a block', i.e. in a sin- 
gle big step. As a result, one cannot gradually interpolate between the two ordi- 
nary statistics: for instance, in order to retrieve the Bose-Einstein distribution, one 
must drop the entire degree sequence k from the set of constraints (mathematically. 
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this corresponds to set x equal to the unit vector l)|j The fundamental difference 



with respect to the intermediate distributions defined by eq. (1.12 1 is that now the 
occupation probability of an empty 'single-link state' differs from the occupation 
probability of an already occupied state. In fact, this family of distributions can be 
intuitively described by saying that the 'first' appearance of a link of unit weight be- 
tween two disconnected vertices is regulated by the Fermi-Dirac statistics, while the 
'subsequent' appearance of units of weights between two already connected vertices 
is regulated by the Bose-Einstein statistics. For this reason, the statistics defined by 



eq.( 1.16 1 is called the mixed Bose-Fermi statistics |32|. 

The Bose-Fermi statistics reproduces with great accuracy all the four higher- 
order structural quantities (both binary and weighted) of the ITN considered so far. 
This is shown in fig. |1.7| where we compare the expected and observed values of 
knn, c, s™, and c" for all vertices. For the first time, we find a very close agreement 
for all these topological properties simultaneously. This result is very robust, as it 
holds for different snapshots and different commodities ll24l . Two main conclusions 
can be drawn: 

• on one hand, the addition of weighted constraints to the binary ones does not 
affect the effectiveness of the mixed model in reproducing the purely topological 
properties: the two upper panels of fig. |1.7|look approximately the same as the 



two panels of fig. 1 .4 



on the other hand, the addition of binary constraints to the weighted ones dra- 
matically improves the performance of the model in predicting purely weighted 



quantities. This is evident by comparing fig. 1.5 with the two bottom panels of 
fig. |1.7[ The latter show that now both the average nearest neighbour strength 
and the weighted clustering coefficient closely follow the identity function. In a 
sense, the addition of purely binary constraints compensates the incapability of 
the purely bosonic model in reproducing the network structure and brings the 
model back to high levels of performance at the topological level. 

The family of mixed Bose-Fermi statistics not only represents a powerful model in 
order to explain both the binary and the weighted quantities of interest of a given, 
observed network; it also points out the strong effects of the underlying topology on 
the weighted structural patterns. 

In economic terms, our discussion leads to the conclusion that the knowledge 
of monetary/weighted structural properties (such as total imports and exports) is 
informative only if in combination with non-monetary/binary properties (such as 
the number of trade partners). The expectation that monetary quantities are per se 
more informative than the corresponding binary ones turns out to be incorrect. For 
this reason, we believe that the Bose-Fermi statistics is a very useful tool in the 
understanding of economic networks in general. The fact that both strengths and 



' Note that the dual operation, i.e. dropping the entire strength sequence s from the set of constraints 
(mathematically corresponding to y = 1), leads to an undefined model and does not correspond to 
the Fermi-Dirac statistics. The reason is that the maximum weight is still w„u,x = +°° and not 
Wmax = 1: without constraints on the weighted properties, the expected weights become infinite. 
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Fig. 1.7 The topology and weights of the World Trade Web are simultaneously well reproduced 
by specifying the number of partners of each country, as well as the total import and export value 
of each country (mixed Configuration Model). Top left: observed VS expected average nearest 
neighbour degree k"" , for all vertices. Top right: observed VS expected clustering coefficient c. 
Bottom left: observed VS expected average nearest neighbors strength s"" . Bottom right: observed 
VS expected clustering coefficient c'"\ The red curves are identity lines (perfect agreement). The 
green curves represent the prediction for the same quantities under the Gravity Model, which in- 
stead predicts a completely connected network with all vertices characterized by the same value. 
Data: UNCOMTRADE database |21 1, year 2001 (W = 162 countries). The values are rescaled by 
the total weight. 



degrees are enforced allows to study the interplay between the topological and mon- 
etary levels of organization, while still keeping the model parsimonious: only local 
(country-specific) structural properties, the ones that we discussed as the somewhat 
irreducible and 'unavoidable' level of heterogeneity, are enforced. 



1.9 The role of GDP and distance 



The results presented so far show that the attempt to model the International Trade 
Network using analogies with physical laws, initiated by Jan Tinbergen with the in- 
troduction of the Gravity Model, turns out to be extremely successful, even if the 
appropriate formal expressions are very different from Tinbergen's original idea. 
However, to complete our discussion, we need to address the final point anticipated 
i.e. how to reconcile maximum-entropy graph ensembles (that take struc- 



in sec. 



1.4 



tural properties as input) with the Gravity Model's expectation that international 



1 Jan Tinbergen's legacy for economic networks 23 

trade strongly depends on non-structural macroeconomic properties, such as GDP 
and geographic distances. 

We start with a discussion about the role of GDP. As we anticipated in sections 



1.3 and 1.5 the GDP of a country turns out to be highly correlated with the degree 
of that country in the ITN. Interestingly, the functional dependence of the degree on 
the GDP can be adequately characterized by eq.( |1.8| l. In more detail, it was shown 
191 that the connection probabiUty 

zGDPjGDPj 
^''~ l+zGDPiGDPj ^^-^'^ 

(where z is a global free parameter) defines a model that reproduces the properties of 
the binary topology of the ITN very well, just like the Configuration Model defined 
by eq.( |1.8| l does. The value of z is fitted to the data b y re quiring that the expected 
number of links (L) equals the observed number L |9 iPjThis result shows that the 
parameter x, is approximately proportional to GDPi, as can be confirmed explicitly 
ll33l . In terms of the network formation process, this means that the results discussed 
in sec. 1.5 can be almost entirely rephrased as follows. The GDP is found to deter- 
mine directly the number of trade partners of each country (because GDPi has the 
same role of the Lagrange multiplier jc, determining A:,), and indirectly the whole 
topology of the International Trade Network (because the functional form of the 
connection probability is that of the Configuration Model, where the higher-order 
topological properties are entirely determined by the degree sequence). The only 
topological quantity we need to know about the real network is the total number of 
links specifying the parameter z- 

In an only slightly more complicated way, it is also possible to incorporate dis- 



tances into eq. (1.17 1 |34J. This leads to the probability 



zGDPiGDPje-^f^''-J^ 
1+zGDPiGDPje- 



P'^- ..-^:.^L^-rm, (1-18) 



where f{dij) is some increasing function of the geographic distance between coun- 
tries i and j. The simplest choice for this function is f{dij) = dtj 1351 . The model 
has now two parameters, which can be fixed simultaneously by imposing (L) = L 
and (F) = F, where F = Y.tj'^ijfidij) is a measure of the filling of space by the 
network l35| . A variant of this model has been recently used to analyse the directed 
version of the ITN |34|. The result one finds is that the addition of spatial infor- 
mation moderately improves the fit to the data. However, alternative models that 
include information about the reciprocity of trade lT9ll36ll . rather than geographic 
distances, systematically outperform the spatial model l34l . 

We note that, along the same lines as above, it is straightfoward to introduce 
GDPs and distances also in the weighted models defined by eq.(|l.ll[) and ( 1.16 1, by 



'" This clioice for tlie parameter z corresponds to the maximization of the Ukelihood of the model 
defined by eq.jl.l7| |33|, exactly like the values of {x,} that realize the conditions (kj) = kt V/ 



maximize the likelihood of the Configuration Model defined by eq.jl.8b I i28ll33 
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simply replacing ytyj and xixj with zGDPt GDPj e^^^^'^'J' . Even if these weighted 
models have not yet been used in emprical analyses, the above discussion shows that 
maximum-entropy ensembles are not per se incompatible with the Gravity Model's 
approach of explaining trade patterns in terms of macroeconomic quantities such 
as GDP and distances. On the contrary, we believe that maximum-entropy models 
are a very promising tool to understand economic networks. Identifying the most 
informative properties explaining the topology and weights of real economic net- 
works is extremely important in order to identify the most relevant 'targets' of the- 
oretical models. The finding that the observed trade patterns cannot be adequately 
understood unless one is able to reproduce the degree sequence, and that the latter 
is directly determined by the GDP, could only be established using a maximum- 
entropy model. More in general, the important role played by binary properties even 
in weighted analyses is a highly nontrivial result. 



1.10 Conclusions 

In 1962, in what we would now call a pioneering attempt to model economic net- 
works, the physicist and first Nobel Memorial Prize laureate Jan Tinbergen intro- 
duced the Gravity Model of trade mimicking Newton's gravitation law. This very 
intuitive and elegant proposal aimed at explaining trade exchanges in terms of a few 
macroeconomic quantities (GDP and geographic distance) by combining them in 
the same way as nature combines gravitational masses and spatial distances. 

The success of the Gravity Model is due to the fact that it reproduces well the 
observed (non-zero) trade flows between countries. Minor refinements to the model, 
such as the inclusion of additional factors either favouring or suppressing trade, are 
relatively simple to make and further improve the fit to the data. Therefore, for half 
a century the Gravity Model has been used more and more extensively in macroe- 
conomic analyses, and it has become the standard model of international trade in 
the economics literature. However, the most serious and in some sense irreducible 
limitation of the Gravity Model emerged only relatively recently, after the publi- 
cation of several empirical analyses documenting the topology of the International 
Trade Network in the statistical physics literature. While the Gravity Model predicts 
a completely connected network where every country trades with all other countries, 
the observed ITN is much more heterogeneous and hierarchical. 

We have shown that this limitation can be overcome by adopting a probabilis- 
tic view, in exactly the same way as classical physics escaped its crisis at the end 
of XIX century by adopting the quantum-mechanical paradigm. In network theory, 
this amounts to consider the adjacency matrix entries as probabilistic entities and 
the node pairs as single-link states whose occupation numbers are regulated by the 
same selection rules that apply to fermions and bosons in quantum physics. In this 
way, various probability distributions can be defined in order to explain the observed 
structural patterns. On one hand, purely fermionic selection rules excellently repro- 
duce the binary topology of the ITN, but are intrinsically limited by the fact that 
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they give no information about the weights of Hnks in the network. On the other 
hand, bosonic selection rules are suitable for weighted analyses but suffer from the 
same limitations of the Gravity Model, since they lead to the prediction of an almost 
completely connected network. Interestingly enough, the most effective probabilis- 
tic models are those combining fermionic and bosonic selection rules. In this combi- 
nation, the limitations encountered when the two quantum statistics are considered 
as separate are overcome simultaneously, and both the topology and weights of the 
observed ITN are nicely reproduced. 

The main conclusion we can draw is the fundamental role played by topology in 
explaining the observed patterns of real world networks: in contrast with the 'main- 
stream' economic thinking, purely weighted information (such as that encoded into 
the strength sequence) is not enough to reproduce all the observed properties and, in 
particular, the purely binary ones (such as the degree sequence). A genuine, purely 
binary information is also needed from the very beginning, as confirmed by the 
successful family of mixed Bose-Fermi probabilistic distributions. This shows that 
the naive expectation that weighted/monetary quantities are per se more informative 
than the corresponding binary/non-monetary ones is incorrect. The counter-intuitive 
nature of this finding shows that it is very important to further develop an appropriate 
information-theoretic formalism, based on maximum-entropy statistical ensembles, 
aimed at identifying the key structural properties of economic networks. 

Curiously, the road taken by Jan Tinbergen appears to lead to 'physical' laws that 
are quite different from the ones originally postulated, and more similar to quantum 
statistics than gravitation. However, the deep epistemological reasons underlying 
Tinbergen's idea of introducing the Gravity Model of trade appear to be very appro- 
priate, and persist throughout the more recent approaches. This is, we believe, the 
most important legacy that Jan Tinbergen left us for the modern understanding of 
economic networks. 
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